2D1431 Machine Learning Lab 4: Reinforcement Learning
نویسندگان
چکیده
In this lab you will learn about dynamic programming and reinforcement learning. It is assumed that you are familiar with the basic concepts of reinforcement learning and that you have read chapter 13 in the course book Machine Learning (Mitchell, 1997). The first four chapters of the survey on reinforcement learning by Kaelbling et al. (1996) is a good supplementary material. For further reading and a detailed discussion of policy iteration and reinforcement learning, the textbook “Reinforcement Learning” is highly recommendable (Sutton and Barto, 1999). In particular studying chapters 3,4 and 6 is of immense help for this lab. The predefined Matlab functions for this lab are located in the course directory /info/mi03/labs/lab4. Dynamic programming refers to a class of algorithms that can be used to compute optimal policies given a complete model of the environment. Dynamic programming solves problems that can be formulated as Markov decision processes. Unlike in the reinforcement learning case, dynamic programming assumes that the state transition and reward functions are known. The central idea of dynamic programming and reinforcement learning is to learn value functions, which in turn can be used to identify the optimal policy.
منابع مشابه
2D1431 Machine Learning Lab 3: Reinforcement Learning
In this lab you will learn about dynamic programming and reinforcement learning. It is assumed that you are familiar with the basic concepts of reinforcement learning and that you have read chapter 13 in the course bookMachine Learning (Mitchell, 1997). The first four chapters of the survey on reinforcement learning by Kaelbling et al. (1996) is a good supplementary material. For further readin...
متن کامل2d1431 Machine Learning Lab 3: Instance Based Learning & Neural Networks
In this lab you will learn about instance based learning algorithm (locally weighted regression) and artificial neural networks and apply both techniques to function approximation. You will also learn how to use crossvalidation for parameter and feature selection. You will have to implement the code for locally weighted regression and cross-validation. You will use some existing code for the ba...
متن کامل2D1431 Machine Learning Lab 2: Bayes Classifier & Boosting
In this lab you will implement a Bayes Classifier and the Adaboost algorithm that improves the performance of a weak classifier by aggregating multiple hypotheses generated across different distributions of the training data. Some predefined functions for visualization and basic operations are provided, but you will have to program the key algorithms yourself. During the examination with the la...
متن کامل2D1431 Machine Learning Lab 1: Concept Learning & Decision Trees
You have to prepare the solutions to the lab assignments prior to the scheduled labs, which are mainly for examination. In order to pass the lab you present your program and answers to the question to the assistent. Labs can be presented in groups of two, however both students need to fully understand the entire solution and answers. It is also assumed that you complete the assignment on your o...
متن کاملIncremental Machine Learning to Reduce Biochemistry Lab Costs in the Search for Drug Discovery
This paper promotes the use of supervised machine learning in laboratory settings where chemists have a large number of samples to test for some property, and are interested in identifying as many positive instances for the least laboratory testing effort. Rather than traditional supervised learning where the chemists would first develop a large training set and then train a classifier, the pap...
متن کامل